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1.  Introduction  and  Summary 

Let  Xj,  x^,  ...  be  independent,  identically  distributed  random  vari 
ables  with  mean  y,  and  put  Sn  *  x^  ♦  ...  ♦  xr.  In  a  variety  of  contexts 
e.g.  sequential  analysis,  nonparametric  statistics,  queueing  theory, 
insurance  risk  theory,  one  is  interested  in  a  probability  or  expected 
value  defined  in  terms  of  the  time  at  which  Sn  first  crosses  a  boundary 
c(n).  In  general  c(n)  is  nonlinear,  but  the  linear  case  is  particularly 
inqwrtant.  Exact  results  are  possible  only  in  very  special  circumstances, 
but  one  can  obtain  asymptotic  approximations  reasonably  generally. 

One  possible  formulation  involves  contraction  of  the  time  and  space 
coordinates  by  the  factor  m"1  and  consideration  of  the  asymptotic 
behavior  of  (for  example) 

(1)  P{Sn/m  £  c(n/m)  for  some  1  £  n  £  m) 

as  ®*».  If  c(0)  >  0  and  c  is  concave,  it  is  clear  from  the  law  of 
large  numbers  that  (1)  converges  to  0  if  y  <  c(l)  and  to  1  if 
u  >  c(l).  Precise  asymptotic  approximations  to  (1)  have  been  developed 
largely  in  the  context  of  sequential  analysis.  See  [23]  or  [27]  for  a 
summary  of  results  and  statistical  applications.  Since  these  approxima¬ 
tions  involve  the  rate  at  which  a  sequence  of  probabilities  tends  to  0, 
it  seems  appropriate  to  call  them  large  deviation  approximations. 
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A  second  possibility  is  to  contract  tine  by  the  factor  m"1  and  space 

_L 

by  m  ,  so  that  instead  of  (1)  one  considers 

(2)  >_  c(n/m)  for  some  1  £  n  £  m} 

L 

with  the  additional  assumption  that  p  =  Vm  ~  C/m  for  some  fixed  -<*>  <  C 
<  <*>.  Then  under  quite  general  conditions  (2)  converges  as  or*®  to 

(3)  P{W(t)  >_  c(t)  for  some  0  <_  t  <_  l)  , 

where  W(t)  is  Brownian  motion  with  drift  £.  Unfortunately  the  approxi¬ 
mation  of  (2)  by  (3)  is  rather  crude  unless  m  is  quite  large,  so  it  seems 
interesting  to  try  to  develop  an  asymptotic  expansion  of  (2)  with  (3)  as 
the  leading  term.  Such  approximations  resemble  Edgeworth  expansions  to 
improve  the  central  limit  theorem.  They  have  been  called  corrected  dif¬ 
fusion  approximations  [21]. 

The  purpose  of  this  paper  is  to  survey  the  present  state  of  develop¬ 
ment  of  corrected  diffusion  approximations  and  to  compare  them  with  the 
considerably  more  developed  large  deviation  approximations. 

Section  2  reviews  the  ruin  problem  of  risk  theory,  for  which  the 
classical  large  deviation  approximation  is  due  to  Cramer  (cf.  Feller, 

[7],  p.  411).  Section  3  discusses  some  related  results  for  which  a  large 
deviation  formulation  is  in  a  certain  sense  inappropriate.  Section  4  is 
concerned  with  first  passage  times  to  a  linear  boundary  with  applications 
to  the  one  sample  Kolmogorov- Smirnov  statistic,  truncated  sequential  prob¬ 
ability  ratio  tests,  and  risk  theory. 

The  results  of  Sections  2-4  involve  crossings  of  linear  boundaries, 
for  wbich  corrected  diffusion  approximations  are  typically  as  good  and 


often  better  than  large  deviation  approximations.  However,  it  has  proved 
difficult  to  develop  corrected  diffusion  approximations  for  nonlinear 
boundaries,  and  it  is  only  recent  research  of  Hogan  [9]  that  has  led  to 
some  progress.  Section  5  discusses  one  particular  application  of  Hogan's 
work  to  repeated  significance  tests  for  a  normal  mean. 

2.  Ruin  of  a  Risk  Process 
For  b  >  0  define 

t  =  t (b)  »  inf{n:  Sn  b}  , 

with  the  understanding  that  inf  <p  -  **>.  In  this  section  we  shall  review 
some  known  results  concerning  P{r(b)  <  »}.  With  the  proper  interpreta¬ 
tion  this  is  the  probability  of  ultimate  ruin  of  a  risk  process  or  the 
stationary  waiting  time  distribution  of  a  single  server  queue.  See 
Feller  [7],  Chapters  VI  and  XII. 

Assume  that  the  distribution  of  the  x.^  is  of  the  form 

(4)  dFy(x)  =  exp{0x  -  iK0)}  dF(x)  , 

where  y  =  t|>*(0)  *  /  xdFy(x)  is  strictly  increasing  (unless  F  is  con¬ 
centrated  at  a  point),  so  one  can  consider  y  to  be  a  function  of  0, 
or  0  a  function  of  y  -  say  0(y).  After  a  linear  transformation  one 
can  assume  without  loss  of  generality  that 

(5)  /  xdF(x)  -  0  ,  f  x2dF(x)  -  1  . 

In  particular  Fq  ■  F.  Since  \p  is  strictly  convex,  given  any  y^  <  0 
(y^  >  0)  there  exists  at  most  one  value  y^  >  0  (y^  <  0)  such  that 
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♦ 


jp (0q)  »  (6 j ) »  where  *  Q .  We  shall  assume  that  such  a  "conjugate" 

value  of  y  exists  for  all  y  t  0. 

By  Wald's  likelihood  ratio  identity,  for  any  stopping  time  N  and 
event  A  e  SCXj,  x^) ,  for  all  y  i  y' 

(6)  (A  n  {N  <  »})  *  E^.CexptCB-e')^  -  Nty(d)-f(e')];  A  ft  {N  <  «}]  , 

where  0  *  8(y),  0'  *  0(y'),  and  E(Z;B)  =  E(ZIg).  In  particular,  if 
Uq  <  0  <  y^  are  conjugate  in  the  sense  of  the  preceding  paragraph,  then 
(6)  yields 


(7) 


(t  <  ®)  *  E  [exp{-A(S  -b)}]  exp(-Ab)  , 
a0  U1 


where  A  »  0l"6O*  From  (7),  the  renewal  theorem,  and  some  calculation,  it 
follows  that  as  b*» 


(8) 


P  (t  <  «} 
M0 


K(y0)  exp(-Ab)  , 


where,  for  nonarithmetic  random  variables. 


(9)  K(yQ)  *  P  -  »}  P  (t_  ■  “J/Ayj 

M0  1 

with  (t_)  ■  inf(n:  Sn  >  0  (Sn  <  0)}.  In  general  K(y0)  ®ust  be 

evaluated  numerically  [27],  p.  25.  The  relation  (8)  is  Cramer's  classical 
approximation  obtained  essentially  by  the  method  introduced  by  Feller  [7], 
Chapter  XII. 

Suppose  now  that  b-**,  yQ-*0  in  such  a  way  that  byQ  is  bounded 
away  from  -»  and  0.  It  is  easy  to  see  that  Ab  is  bounded  away  from 
0  and  ®.  The  following  result  appears  in  [21]. 
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(10)  Theorem.  Suppose  b-*®,  A-*-0  so  that  Ab  is  a  fixed  positive  number. 
If  F  is  nonarithmetic,  then 

P  (r  <  ®>  s  exp(-Ab)  [1  -  p+A  +  o(A)]  , 

2 

where  p+  *  h  E_(S  )/EQ(S  ).  If  F  is  strongly  nonarithmetic  in  the 

u  T+  T+ 

sense  that 

lim  inf  |l  -  [  e*^X  dF(x)|  >  0  , 

]  X  ]  -*• ®  J 

then 

(11)  P  (t  <  “}  =  exp(-Ab)  Cl  -  p+A  +  h  p2  A2  +  o(A2)]  . 

Remarks . 

2  2 

(12)  Since  1-x  +  Jjx  *  exp(-x)  +  o(x  )  as  x+0,  and  exp(-x)  is  always 
positive,  it  seems  reasonable  for  numerical  purposes  to  express  (11)  in 
the  form 


This  also  has  an  appealing  interpretation,  since  exp[-A(b  +  p+)]  is 
exactly  the  probability  that  Brownian  motion  with  drift  ever  crosses 

the  level  b  +  p+.  Most  of  the  results  of  Sections  3-5  admis  similar  inter¬ 
pretation. 

(13)  At  first  glance  (11)  is  quite  surprising,  because  an  informal  expan- 

2 

sion  of  (7)  leads  one  to  expect  the  coefficient  of  A  in  (11)  to  involve 

lim  E  C(S  -b)2]  -  E.(S*  )/3  Eq(S  ). 
b®°  M1  T  ♦  + 

(14)  The  proof  of  (11)  follows  from  (19)  -  (21)  given  below.  For  details, 
see  [21]. 
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(15)  There  is  a  version  of  (11)  in  the  case  that  F  is  an  arithmetic  dis 
tribution,  provided  b-**  through  the  span  of  the  support  of  F. 


(16)  In  view  of  (8)  and  (11),  it  is  not  surprising  to  obtain 

K(y„)  =  exp(-Ap+)  *  o(A2)  (A-0)  , 

which  follows  from  (20)  and  some  easy  calculation. 

(17)  An  algorithm  for  computing  p+  numerically  is  given  in  [21] .  For  the 

-h  2 

special  case  dF(x)  *  (2tt)  expf-Jj  x  ), 

foo 

X-2  log{2[l-exp(-Js  X2)]/X2}  dX  5  .583  . 

JO 

(18)  Because  of  (16)  one  expects  (and  is  supported  by  experience)  to  find 
that  (8)  and  (12)  give  similar  numerical  results.  The  main  advantage  of 
(12)  is  that  it  requires  numerical  evaluation  of  only  a  single  constant. 

The  results  of  Section  3  provide  examples  where  the  asymptotic  normalization 
of  (8)  is  difficult  to  justify,  but  precise  analogues  of  (12)  are  easily 
obtained. 

Let  W(t),  o  £  t  <  °°,  denote  Brownian  motion  with  drift  P  and  put 
TW  =  TW(b)  =  inf{t:  W(t)  -  b}  • 

let  G(t;  p,b)  =  P  {xw(b)  <  t} .  Suppose  to  be  definite  that  F  is  non- 
P  "  — 

arithmetic  and  let 

H(x)  =  (E  S  )_1  [  P  (S  >  y}dy  . 

T+  J (0,x)  u  T+  ~ 

The  following  lemmas  yield  (10)  as  an  immediate  consequence. 
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(19)  Lemma.  Suppose  F  is  nonarithmetic.  Let  y-*-0,  b-*»  in  such  a  way 
that  for  some  -<*>  <  C  <  °°»  yb  -*■  £.  Then  for  all  0  <_  t,  x  <_<*> 

lim  P  {t(b)  <  b2t,  ST-b  <  x}  =  G(t;  5,1)  H(x)  , 

and  for  all  r  >  0 

lim  Eu((ST-b)r;  t  <  »}  =  E0(S*+1)/(r+l)  Eq(St  )  . 

(20)  Lemma.  Assume  that  F  is  nonarithmetic.  Let  r  0,  y  >  0.  Then 

lim  yE  (t  V  }  =  E0(sJ;+1)/Cr+l) 
y+0  y  +  + 

and  consequently  as  y+0 

Ey(S^  )  =  Eq(S^  )♦  6r  E0(S*+1)/(r+l)  ♦  o(y)  , 

where  0  *  0(u)  • 

(21)  Lemma.  Suppose  F  is  strongly  nonarithmetic.  There  exists  an 
e  >  0  such  that  uniformly  for  y  e  [0,e] 

EyC(sT-b)j3  *  Ey(sj+1)/OU  Ey(Sx  )  ♦  0(e'eb) 

for  j*l,  2,  3. 

3.  Two-Sided  Problems 

Consider  the  stopping  rule  of  a  sequential  probability  ratio  test 

(22)  T  =  inf{n:  Sn  *  [a,b]}  , 
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I 


where  a  £  0  <  b.  If  b*»  and  &*■-<»,  then  for  fixed  yQ  <  0,  Py  (ST  £  b} 

~  P  {r(b)  <  <»} ,  so  a  large  deviation  approximation  does  not  distinguish 
y0 

between  this  situation  and  that  of  Section  2.  In  [18]  I  have  a  heuristic 

approximation  to  P  {S  >  b)  which  did  not  lose  sight  of  the  lower 
P0  T 

boundary  at  a,  but  which  seems  impossible  to  justify  in  general.  The 
following  approximations  do  have  precise  mathematical  justification  and 
agree  with  the  heuristic  approximations  of  [18]  when  y-K). 


(23)  Theorem.  Assume  that  F  is  nonarithmetic.  Suppose  b*“,  ar*--“>,  and 

A  *  0  in  sucb  a  way  that  Ab  40(1  b/lal  are  fixed  positive 

numbers.  Then  for  j=0  or  1 

1  -  exp[(-l)^  A(a+p  )] 

(24)  P  (ST  >  b}  *  - 5 - - - ♦  o(A)  , 

yj  1  exp[ (-1)^  A(b+p+) ]  -  exp[(-l)J  A(a+p_)] 

where  p+  *  H  E0(S^  ^eo^St  ^  and  T+  ("O  =  inf^n:  Sn  >  0  ^Sn  — 

Also 

E  (T)  =  yT^a+p  )  ♦  (b+p+  -  a-p  )  p*]  o(A-1)  . 

JJj  -> 

where  p*  denotes  the  right  hand  side  of  (24).  If  F  is  strongly  nonarith- 

2 

metic,  the  remainder  in  (24)  is  o(A  ). 


Remarks . 

(25)  Like  (12),  (23)  has  an  interesting  interpretation  in  terms  of  Brownian 
motion:  The  approximations  would  be  exact  for  Brownian  motion  with  drift 

£  *sA  stopped  upon  hitting  the  boundaries  b  ♦  p+  or  a  ♦  p_. 

(26)  For  the  case  y*0,  see  [18], 
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(27)  Example. 

Suppose  y^,  y —  are  independent  with  probability  density  function 

Ae”*x  for  x  ^  0.  A  sequential  probability  ratio  test  of  H^:  A  *  A^ 

against  H^:  A  =  A^  (A^  <  A^)  is  defined  by  a  stopping  rule  of  the 

form  (22)  with  S  =  \  (1  -  A*y.),  where  A*  =  (A(1)  -  A(0))/log(A(1)/A(0)) . 
n  1 

For  this  test  the  classical  approximations  of  Wald  (26]  for  the  error  prob¬ 
abilities  and  expected  sample  size  are  given  by  (23),  but  with  p+  =  p  =  0. 
From  the  lack  of  memory  property  of  the  exponential  distribution  it  follows 
that  ST  is  exponentially  distributed  and  hence  that  p_  =  -1.  An  easy 
calculation  based  on  the  Wiener-Hopf  factorization  ([7],  p.  605)  shows  that 
p+  =  1/3.  Hence  (23)  provides  a  theoretical  explanation  for  the  modifica¬ 
tions  to  Wald's  approximations  suggested  independently  in  [8]  and  [12]. 

(The  modification  of  a  to  a-1  is  also  easily  justified  by  relating  this 
test  to  an  equivalent  one  for  the  mean  of  a  Poisson  process  observed  con¬ 
tinuously,  for  which  there  is  no  excess  at  the  lower  boundary.  But  the 
suggestion  to  replace  b  by  b  +  1/3  has  apparently  been  justified  on 
the  basis  of  numerical  evidence.) 

Now  suppose  that  T  is  defined  by  (22)  with  a=0.  This  stopping  rule 
arises  in  the  study  of  cusum  control  charts  [5],  where  one  quantity  of 
particular  interest  is  E^(T)/P^(S^,  >.  b) ,  the  so-called  average  run  length. 

(28)  Theorem.  Suppose  a *0  and  F  is  nonarithmetic.  Let  and 

A-*-0  so  that  Ab  is  a  fixed  positive  number.  Then  for  j=0  or  1 

E  (T)/P  (S  >  b)  *  (A j u - 1 ) ” 1  (exp[(-l)j  A(b+p+  -  p  )] 

j  j  J 

-  (-l)j  A(b*p+  -  P.)  -  1)  ♦  o (A~ 1 )  , 

where  p+  and  p_  are  defined  as  in  (23) . 
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Remarks . 


(29)  To  see  that  this  approximation  provides  excellent  numerical  results 
it  can  be  compared  with  the  exact  numerical  computations  of  van  Dobben  de 
Bruyn  [5]  for  the  case  of  the  normal  distribution  and  Lorden  and  Eisenberger 
[15]  for  the  exponential  distribution. 

(30)  The  proof  of  (28)  uses  (20)  repeatedly;  see  [23]  for  details. 

(31)  An  interpretation  of  (28)  in  terms  of  Brownian  motion  involves  the 
replacement  of  b  by  b+p+  -  p  .  Without  this  correction  the  approximation 
is  very  poor  for  y  <  0. 

-h  2 

Assume  now  that  dF(x)  =  (2ir)  exp(-H  x  ) ,  so  the  distribution 

of  x.,  Xj,  ...  is  normal  with  mean  y  and  variance  1.  A  special  but 
particularly  interesting  case  of  a  class  of  sequential  tests  proposed  by 
Anderson  [  ]  is  based  on  the  stopping  rule 

T  =  inf{n:  |Sn|  _>  b  -  h  p)  (p  >  0)  . 

Anderson  studied  these  tests  for  Brownian  motion.  In  general  a  corrected 
diffusion  approximation  for  the  error  probabilities  is  quite  complicated, 
but  for  y  =  -p  (with  a  similar  result  for  y=n),  if  b-*°°,  p-*-0  so  that 
bp  is  fixed,  then 

(32)  P-q^ST  >  =  **  exp[-q  (b+p+)  ]  +  o(p)  , 

where  p+  5  .583  (cf.  (17)).  It  is  interesting  to  compare  (32)  with  the 
approximation  of  Lorden  [14],  who  on  the  basis  of  numerical  computations 
and  curve  fitting  suggested 


exp(-nb)  [.4996  -  .28645  q  ♦  .0696  q^]  , 

which  in  the  range  he  explored  is  extremely  accurate.  For  p+  *  .583,  the 
right  hand  side  of  (32)  equals 

exp(-qb)  [.5  -  .2915  q  +  .0850  q2  +  o(q2)]  , 

and  numerically  (32)  is  usually  even  closer  to  Lorden's  approximation  than 
this  expansion  suggests. 

4.  The  Distribution  of  t(b). 

Assume  that  x^,  x^,  ...  are  independent,  identically  distributed  random 
variables  with  mean  0,  variance  1,  and  finite  third  moment  k.  The  expon¬ 
ential  family  assumption  (4)  will  be  introduced  when  needed.  Let  = 

x,  +  ...  +  x  and 
1  n 

t  =  inf{n:  S  >  b)  . 
n  — 

Let  f(X)  =  E[exp(iX  x^)],  and  assume  that  for  some  nQ  _>  1 

(33)  |  |f(X)|  0  dX  <  »  . 

J  -.00 

It  follows  from  (33)  that  for  all  n  n^  Sn  has  a  bounded  density  func¬ 
tion  which  obeys  a  variety  of  local  limit  theorems  (cf.  (45)  below).  It 
will  be  convenient  to  use  the  notation 

pW(A)  *  PCA|Sn  ■  5)  ,  A  .  i;i, . *„)  • 

u  u 

(34)  Theorem.  Suppose  b  =  m  T,  and  £  =  m  £q  for  some  £  >  0  and 

-®  <  <  £.  Then  as  nr*» 
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(35)  P^m)(x  <  m}  =  exp{-2(b+p+)(b*p+  -  ?  -  ic/3)/(m  +  k£/3)}  +  ofm'**)  . 
If  in  addition  c  =  for  some  Y  £  ?»  then 


(36) 


P{t  <  m,  S  <  c)  = 
m 


/c  +  k/3  -  2(b+p  )\  , 

=  *[ - n — ~ )  ♦  o(m^) 

\  (m  +  <c/3)^  / 


3  2 

Here  k  =  E(x^),  p  =  ij  E(S  )/E(S  ),  and  4>  denotes  the  standard  normal 

+  T+  T+ 

distribution  function. 

(37)  Theorem.  Now  assume  that  the  distribution  of  has  the  exponential 
form  (4)  and  that  for  some  n^  £  1  and  all  y  in  a  neighborhood  of  0 


nn 

|E^[exp(iX  x^)]|  dX  <  00 


Let  yQ  <  0  <  \i^  be  such  that  <{>(8^  =  ij>(90)»  where  9i  =  6(y^),  and  put 

A  =  Suppose  that  b  =  A,  c  =  A  for  some  5  >  0,  Y  £  C»  and 

u 

that  m  A  =6  is  a  fixed  positive  constant.  Then  as  m*»,  for  j®0  or 
1 


P  {r<m,  S  <c)  =  expC-t-l)"5  A(b+p  )]  $(- 
y.  m  r  +  \ 


+  o(m  )  . 


c+k/3  -  2 (b+p  )  •  A 

^ -  +  h(.-lr  A(m-Hcc/3 VJ 


(m-ncc/3) 


Remarks . 

(38)  Guided  by  the  results  of  Sections  2  and  3,  the  approximations  (35)- (37) 
have  been  expressed  insofar  as  possible  to  resemble  analogous  exact  results 
for  Brownian  motion.  In  fact,  they  are  precisely  the  corresponding  results 
for  Brownian  motion  with  drift  0  or  £  b  replaced  by  b  ♦  p+,  £(c) 
replaced  by  (  ♦  </3  (c  ♦  </3) ,  and  m  replaced  by  m  ♦  £k/3  (m  ♦  <c/3) . 
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(39)  The  approximations  (36),  (37)  are  similar  but  not  identical  to  those 
suggested  in  [21]  on  the  basis  of  some  heuristic  Laplace  transform  inver¬ 
sions.  To  illustrate  the  difference  consider 

(40)  Py(x  <  m}  =  P^{Sm  >  b)  +  Py(x  <  m,  <  b}  . 

Suppose  for  simplicity  that  F=$  is  standard  normal,  so  Fy(x)  *  $(x-y). 

Then  the  first  term  on  the  right  hand  side  of  (40)  is  exactly 

-h  h 

l-$(bm  ^  -  yin  ),  and  the  second  can  be  approximated  by  (37)  to  obtain 

Py(r  £  m>  S  l-$(bm-li  -  jam*5)  exp[2y (b+p+) ]  $[- (b+2p+)m"'5  -  yin*]  , 

where  p+  =  .583.  The  approximation  suggested  in  [21]  is 

_!*  u  -U  if 

Py(x  £  m)  =  l-4>[(b+p+)m  -  ym  ]  exp[2y(b+p+)]  $[-(b+p+)m  ^  -  ym  ]  . 

-h 

The  differences  between  these  approximations  is  o(m  ),  so  both  are  con¬ 
sistent  with  (37).  However,  in  view  of  (40),  the  first  one  would  seem 
preferable,  and  some  numerical  experimentation  shows  it  is  typically  more 
accurate  -  especially  for  small  values  of  m. 

(41)  Asymptotic  expansions  essentially  equivalent  to  (36)  and  (37)  were 
obtained  in  [24];  but  the  method  of  proof  used  there  broke  down  in  the  most 
interesting  special  case  c=b  (y=C) . 

Examples. 

(42)  Let  Fn(x)  denote  the  empirical  distribution  function  of  n  inde¬ 
pendent,  uniform  random  variables.  The  representation  of  uniform  order 
statistics  in  terms  of  exponentially  distributed  random  variables  (e.g. 

[3],  p.  285)  shows  that 
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-  (n+1)  *  -1}  , 


P{  sup  (x  -  F  (x))  >  n~\}  =  P{  max  (W.-j)  >  n\  -  1  { W  . 

(Xx<l  n  l£j<n  J  n 

where  W.  *  +  ...  *  with  y^  y2>  ....  yfl+1  independent  standard 

exponential  random  variables.  Putting  a  =  n+1,  £  =  -1,  b  *  n  5-1,  and 
p+  *  1  (because  ST  is  standard  exponential)  in  (35)  yields 

P{  sup  n5s(x  -  F  (x))  >  5}  *  exp[-25(5  ♦  (Sn^)"1)]  ♦  o(n_ls)  , 

0<x<l  n 

a  result  first  derived  by  Smirnov  [253.  This  approximation  is  much  simpler 
and  almost  as  accurate  as  the  large  deviation  approximation  given  in  [22]. 

See  [22]  or  [29]  for  numerical  examples. 

(43)  The  approximations  (34),  (SS),  and  similar  results  for  two-sided 
stopping  rules  can  be  used  to  calculate  error  probabilities  for  truncated 
sequential  probability  ratio  tests.  See  [22]  and  [23]  for  numerical  examples 
for  normal  and  exponential  distributions, 

(44)  Asmussen  [2]  has  studied  various  approximations  to  the  probability 
of  ruin  before  time  t  for  a  risk  process,  and  has  concluded  that  an 
approximation  similar  to  (37)  is  better  than  several  alternative  proposals. 

Proof  of  (34) . 

Let  fn  denote  the  probability  density  function  of  Sn,  which  by  (33) 
exists  for  all  n  n^.  According  to  [16],  p.  207. 

(45)  f^xn^n1*  «  $(x){l  ♦  (tc^n*)  (x3-3x)}  ♦  (1  ♦  |x| V1  o(n_,J)  , 
where  o(»)  is  uniform  in  x. 

Let  P^( A)  -  P(A|Sq  *  A,  Sb  *  £).  Set  \  «  m*  \Q  and  £  -  ***  £Q 

u 

for  some  XQ,  £Q  <  Let  5'  ■  2b-£  *  m  (25-£Q)  denote  £  reflected 


I 
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about  b.  From  (45)  and  the  renewal  theorem  it  follows  as  in  Lemmas  1-3 

_2 

of  [24]  that  for  m,  *  m[l  -  (log  m)  )]  and  some  c  -*-0 
l  in 

(46)  p5°1{t  <  m}  -  P,(nJ{x  <  m.,  S  -b  <  m^  e  }  *  o(m“,i)  . 

A  A,  ^  it  ra 

and 

(47)  P^m^,{T<m}  -  P^a^t{T<m1,  ST~b  <  m*5  effl}  *  o(m-5s)  . 

Set  A  =  (t  <  m.. ,  S  -b  <  m*5  e  },  and  let  fc^(n,  S  )  denote  the  likeli- 

hi  it  n  Ti 

hood  ratio  of  x. ,  . . . ,  x  under  pfml  relative  to  P^m£, .  For  all 

l  n  a,  5  a, 5’ 

n  £  m-nQ 


(n,  Sn) 


f_  n(£-S  )  f  (C’-X) 
m-n  n  m 


By  (46)  and  Wald's  likelihood  ratio  identity 

(49)  PX^{T  <  *  E^m|t{a(m)(T,  ST);  Affl)  +  ofa*)  . 

Substitution  of  (48)  into  (49)  and  expansion  with  the  aid  of  (45)  gives 
the  first  order  result 


PJ[”^T  <  ■>  expC-2(c-XQ)  (C-e0)  J  • 

This  motivates  the  following  reformulation  of  (49)  (which  is  justified  by 
(47)  and  the  fact  that  P^n^,{x*m}  »  oOn'*5)): 

(5°)  P^{t  <  m)  -  expC-2(^-X0)  (?-£„) ]  ♦  o(m'H) 

“  (T*  V  -  •*Pt-2(C-X0)  (C-C0)];  Kj  • 
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The  likelihood  ratio  of  xj»  xn  under  P^  relative  to  P(»|SQ*X) 

is  f  (£'-S  )/f  (£'-X).  Hence  by  (48)  and  Wald's  likelihood  ratio 
m-n  n  m 

identity  once  again  the  right  hand  side  of  (50)  becomes 


(f_  TU-S)  f  tC?’-St)  |  » 

(51)  E|  f  (f-X)  expC-2(;-X0)  (C-C0)3  /  (5'-X)  ;  Am  S0  *  X  I  * 

v  n  in  p 

The  rest  of  the  proof  of  (35)  involves  use  of  (45)  to  expand  the  inte¬ 
grand  in  (51)  and  application  of  (19)  to  evaluate  the  resulting  expectations. 

To  avoid  some  tedious  algebra,  consider  the  special  case  ffl(x)  =  n  $(xn  ) 

u 

(<f>  the  standard  normal  density  function).  Let  Rm  =  ST  -  m  C.  The  inte¬ 


grand  in  (SI)  equals 


C (1  -  T/m)^  <fr(C, 


which  can  be  expanded  to  give 


(52)  -2(1  -  T/m)-5*  exp[5s(C0-X0)2  -  %(?-C0)2/(l  -  x/m)]  [(C-£0)  RB/«V  -  t/m)) 


♦  0(C1  ♦  R^]/m) 


uniformly  on  Am.  According  to  (19)  x/m  and  Rffl  are  asymptotically  inde¬ 
pendent,  converge  in  law,  and  (R  )  is  uniformly  integrable.  Also  (52) 

HI 

is  a  bounded,  continuous  function  of  x/m  on  A  .  Hence  (52)  can  be  sub- 

in 

stituted  into  (51)  and  (19)  applied  to  evaluate  the  result.  Putting 
X”0  and  performing  the  appropriate  integrations  yields  (35)  in  the  case 
of  normally  distributed  x^.  The  general  case  is  similar,  but  with  more 
complicated  algebra. 
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Formally  (36)  and  (37)  follow  by  substituting  (35)  into 


(S3)  P{t  <  m,  S  <  c}  »  [  P*jml{r  <  m)  P(S  e  d£)  • 

m  J  (-»,c)  °*C  18 

However,  some  care  is  required  to  justify  this  calculation  -  especially 
in  the  case  c=b  (y*S)i  when  5  in  (53)  can  be  arbitrarily  close  to  b. 
It  is  easy  to  see  that  (35)  holds  uniformly  on  each  compact  subinterval 
of  (-«,£);  but  if  Sq  5  (52)  is  not  necessarily  bounded,  (46)  may 

fail  to  hold,  and  indeed  the  proof  of  (35)  disintegrates. 

To  circumvent  this  difficulty,  let 

T*  =  sup{n:  n  <  m,  Sn  b)  , 

and  observe  that 

po"c{t  <  m}  a  po?c{t*  >  0}  a  p?o{t  <  m}  * 

so  to  approximate  <  m)  for  C-e  £  ?q  <  S»  it  suffices  to  consider 

P^m|(x  <  m)  for  £s0  and  C-e  <_  XQ  <  £  (recall  that  »  m  \) .  It  is 
easy  to  see  from  (50)  -  (52)  that  uniformly  for  £-e  <  XQ  £  £-m  **  , 

Pj[“J{x  <  m}  *  exp{-2?(£-X0)}  +  CHm’*5)  , 

which  suffices  to  justify  formal  substitution  of  (35)  into  (S3)  for  5 
in  a  neighborhood  of  b. 

There  remains  the  problem  of  large  5.  If  sufficiently  many  moments 
of  exist,  for  example,  under  the  exponential  family  assumption  of 

(37),  it  is  easy  to  improve  (45)  by  a  tilting  argument  and  show  that 
(35)  holds  uniformly  for  -log  m  £  £q  £  C-e.  This  suffices  to  complete 
the  proof  of  (37).  However,  under  the  minimal  assumption  of  a  finite 
third  moment  I  have  not  been  able  to  complete  the  proof  of  (36)  except 
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by  resorting  to  the  rather  different  techniques  developed  in  [24],  which 
prove  (36)  directly  (for  y  <  C)  without  going  through  (35). 

Remark .  It  is  interesting  to  compare  the  preceding  proof  of  (35) 
with  the  superficially  similar  proof  of  the  analogous  large  deviations 
result  (Theorem  1  of  [22]).  An  especially  important  difference  is  that  in 
the  large  deviation  scaling,  after  an  appropriate  use  of  Wald's  likelihood 
ratio  identity  the  distribution  of  t/m  degenerates  to  a  point  mass. 

This  means  that  if  t  were  defined  by  a  nonlinear  boundary,  one  could 
expect  to  obtain  some  results  by  linearizing  the  boundary  at  the  point 
where  the  distribution  of  r/m  concentrates.  In  the  case  of  a  diffusion 
limit  the  distribution  of  t/m  does  not  degenerate,  and  this  makes  non¬ 
linear  problems  substantially  more  difficult  than  linear  ones. 


5.  Repeated  Significance  Tests  for  a  Normal  Mean 


The  first  steps  towards  developing  corrected  diffusion  approximations 


for  first  crossings  of  nonlinear  boundaries  have  been  taken  by  Hogan  [9], 
who  gives  partial  generalizations  of  (36)  and  (37).  This  section  describes 
one  possible  application. 

Let  x^,  x2>  ...  be  independent  N(y,l)  random  variables,  and  put 
Sn  *  Xj  ♦  . . .  ♦  xn-  A  (modified)  repeated  significance  test  of  H^:  y*0 
against  H^*  \ifO  is  defined  as  follows.  Let  0  <  c  £  b  <  ®,  0  <  m^  <  m^ 

<  ®,  and  define 


inf{n:  n  £  mQ, 


[S  |  >  bn**} 
n  — 


Stop  sampling  at  min(T,m)  and  reject  HQ  if  either  T£  m  or  T  >  m 

L 

and  jSj  >  cm  .  The  design  parameters  c,  b,  mQ,  and  m  are  customarily 


chosen  to  give  a  desired  significance  level  and  desired  power  at  some  chosen 
y t  0,  i.e.  to  give  prescribed  values  of 


(55)  P  {T  <  m)  +  P  (T  >  i,  |S  I  >  cnfr 

y  —  y  1  m  — 

=  P{|s|  >  cm5*}  P  (T  <  m,  |s  I  <  cm*4} 
y  m1  —  y  —  nr 

for  y*0  and  some  chosen  y?<0.  Usually  either  c  =  b  *  bj,  say,  and  (55) 
simplifies  to 

Py(T  <  m}  , 

or  b  is  slightly  larger  than  b^  and  c  considerably  smaller,  so  that 
the  test  has  essentially  the  same  power  function  as  a  fixed  sample  test. 

See  [23]  for  a  thorough  discussion. 

In  light  of  (36)  and  (37)  a  natural  conjecture  is  the  following.  (See 

also  [4].)  Suppose  mQ,  nr*®  so  that  the  ratio  m/mQ  remains  constant, 

-h 

and  assume  that  y  is  proportional  to  m  .  -Define 

(56)  T  *  inf(t:  t  _>  m^,  |W(t)|  _>  fat*4  +  p+)  , 

where  p+  is  as  defined  in  Section  2  and  has  approximately  the  numerical 
value  .583.  Then  as  nr*®  (with  b,  c  fixed) 

(57)  Py{T  <  m,  |SJ  <  cm*4}  *  Py(T  <  m,  jw(m)|  <cm*4}  ♦  o(m_J4)  . 

Hogan's  elegant  argument  [9]  lends  considerable  credence  to  (57).  It 
should  be  noted,  however,  that  although  Hogan's  results  are  not  restricted 
to  normal  populations,  they  do  require  that  the  x^  have  vanishing  third 
central  moment.  In  the  contrary  case  the  form  of  a  corrected  diffusion 
approximation  for  nonlinear  boundaries  appears  to  be  much  more  complicated. 
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To  use  (57)  one  oust  compute  numerically  or  by  means  of  some  approxima¬ 
tion  the  boundary  crossing  probability  for  Brownian  motion  which  appears  on 


the  right  hand  side  of  (57).  Techniques  for  obtaining  such  approximations 
have  been  developed  in  [10],  [11],  and  [19].  For  our  present  purposes  the 
methods  described  in  [23]  aTe  convenient.  The  following  result  is  easily 
proved  by  the  methods  of  Theorems  4.14  and  11.1.  (Note  that  the  asymptotic 
normalization  used  here  is  one  of  large  deviations.) 

(58)  Theorem.  Suppose  that  b,  c,  m^,  m**  so  that  b/m  *  <  b/ja*  =  yQ, 

u 

and  c/m  =  y  <_  y^  are  fixed  constants.  Then  for  y  >  0 


(59)  P^{T  <  m,  |W(m)|  <  cm*5}  ~  exp[-m^  y(b-c)  -  p+b2/mc]  $(b-in*u)  b/cm^i  , 


and 


-  U  -t 

(60)  P  (T  <  m,  |W(m)|  <  am4}  *  2b<j>(b)  2  l  x  exp(-p+  *)dx 

v" 

•22,.  2  2 
x  -  1  +  3p+  x  -  2p+  x 


+  2b 


■1  f"°  -1 

1  <|>(b)  x  1  exp(-p  x)  [y. 4 

J  2  -1  1 


MlY 

,  -2  3  ,  -2  2  4n  . 

-  2y1  p+  x  ♦  yj  p+  x  ]  dx 


+  b"1  4> (b)  exp(-p+  yQ)  (p+y0  -  1)  [(Vq/Pj)2  -  3]  +  o[b_1  <Kb)] 


Remark.  The  expansion  (60)  is  given  in  a  convenient  form  for  numerical 
integration.  An  alternative  form  is  perhaps  preferable  if  tables  of  the 
exponential  integral  and  exponential  function  or  incomplete  gamma  function 
are  to  be  used. 

L 

The  approximation  of  Py(T  <  m,  |sj  <  car}  by  the  right  hand  side  of 
(59)  when  y  >  0  is  essentially  the  same  approximation  as  that  suggested  on 


the  basis  of  a  direct  large  deviation  analysis  of  (55)  (cf.  [20]  for  c=b 
and  [22]  for  c  <  b) .  It  is  easily  seen  to  be  very  accurate,  at  least  in 


the  case  c=b,  by  comparing  it  to  the  exact  numerical  computations  of 
Pocock  [17].  The  large  deviation  approximation  of  Pq{T  <  m,  |Sm|  <  an4}, 
to  wit  (cf.  [22]) 

u  fW°  -i 

(61)  Pn(T  <  m,  |  S  i  <  an*}  ~  b<Ji(b)  x  1  v(x)  dx  , 

0  m  ■'2-1 

V 

_  2  - 1  U  7 

where  v(x)  *  2x  exp[-2Z  n  $(-%  xn  )  =  exp(-p+  x)  +  o(x  )  as  x+0 

(cf.  (16)),  is  similar  to  the  first  term  on  the  right  hand  side  of  (60). 

The  second  order  terms  in  (60)  lead  one  to  hope  that  (60)  is  better  than 

(61),  but  on  the  basis  of  the  evidence  given  below  this  does  not  appear 

to  be  the  case,  except  perhaps  when  m^  and  m  are  quite  large. 

Table  1  compares  the  large  deviation  and  corrected  diffusion  approxi¬ 
mations  to  (55)  in  the  case  p=0.  For  the  large  deviation  case  (61)  is 
used  to  approximate  the  second  term  on  the  right  hand  side  of  (55) ,  whereas 
(60)  is  used  for  the  corrected  diffusion  case.  For  comparison,  exact 
numerical  computations  of  [6]  or  [17]  are  included  when  available.  Other¬ 
wise  the  results  of  an  8100  repetition  Monte  Carlo  experiment  +_  one  stan¬ 
dard  error  provide  a  standard  of  accuracy.  For  the  Monte  Carlo  experiment 
a  variation  of  the  importance  sampling  scheme  described  in  [13]  was  utilized 
for  variance  reduction. 

Woodroofe  and  Takahashi  [28]  have  calculated  a  second  order  term  for 
the  large  deviation  approximation  to  (55)  when  b=c.  For  the  last  two 
rows  of  Table  1,  their  approximation  yields  .111  and  .064  respectively. 
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TABLE  1 


PMSJ  >  cm*}  *  Prt{T  <  n,  i <  cm5*} 

U  ID  U  lu 


b 

c 

“o 

m 

Corrected 

Diffusion 

Large 

Deviation 

Exact  or 
Monte  Carlo 

2.289 

2.289 

1 

3 

.044 

.049 

.050 

2. 535 

2.535 

1 

9 

.045 

.050 

.050 

2.986 

2.20 

1 

5 

.031 

.032 

.032 

2.413 

2.00 

1 

5 

.064 

.069 

.072 

2.80 

2.00 

3 

9 

.049 

.050 

.051  ♦  .001 

2.24 

2.24 

4 

24 

.108 

.110 

.114  ♦  .001 

2.S9 

2.59 

20 

100 

.057 

.056 

.059  .001 
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